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ABSTRACT 

We have used integral two-point spatial correlation function and its second deriva- 
tive to analyze the distribution of quasars in three very deep surveys published in the 
literature. Statistically significant (~ 2 — 3a) correlations were found at scales of 
~ 50 — 100ft. -1 Mpc in all of the analyzed surveys. We have used the friend-of-friend 
cluster analysis to show that these correlations can be explained by the presence of 
relatively small quasar clusters (3 — 6 objects) which may possibly belong to larger 
structures such as Large Quasar Groups found in the bigger surveys. The sizes of 
these clusters along the redshift direction and distances between them are similar to 
those for structures found recently in studies of CIV absorption systems. These re- 
sults present further evidence for the existence of large-scale structures at redshifts 
z ~ 1 - 2. 

Key words: large-scale structure of Universe - galaxies: quasars: general. 



1 INTRODUCTION 

Quasi-stellar objects (QSOs) have proved to be very useful 
cosmological probes of the high-redshift Universe. The avail- 
ability of large homogeneous surveys has made it possible 
to study the spatial distribution of quasars using statisti- 
cal tools such as the two-point spatial correlation function. 
These studies have revealed that quasars are strongly clus- 
tered at scales of r < 20/i _1 MpdJ (e.g. Iovino et al. 1991; 
Andreani & Cristiani 1992; Mo & Fang 1993; Romberg et 
al. 1994; Shanks & Boyle 1994 and references therein). Al- 
though there is still an ongoing discussion about whether 
these correlations are evolving with redshift, the fact that 
the amplitude and shape of the quasar correlation function 
are roughly similar to those of low-redshift objects (e.g. An- 
dreani & Cristiani 1992; Mo & Fang 1993; Romberg et al. 
1994; Shanks & Boyle 1994) shows that quasars may possi- 
bly be used as tracers of the matter distribution at medium 
redshifts (z ~ 1 - 2). 

In addition to the statistical results, evidence has been 
found for structures in the quasar distribution at even larger 
(~ 100 - 150/i _1 Mpc) scales (Webster 1982; Crampton, 
Cowley & Hartwick 1987, 1989; Clowes & Campusano 1991a, 
1991b; Graham, Clowes & Campusano 1995; Romberg et 
al. 1996). These structures, which we call Large Quasar 



All the quoted scales are comoving and were calculated assum- 
ing flat Friedman cosmology with Hq = lOO/i km/s/Mpc and 
A = 0. 



Groups (LQGs), consist of ~ 10 — 25 quasars and have 
sizes and density contrasts similar to those of nearby super- 
clusters of galaxies (Romberg et al. 1996). Recently, similar 
high-redshift structures have been found in the distribution 
of absorbers (e.g. Jakobsen et al. 1986; Sargent & Steidel 
1987; Jakobsen & Perryman 1992; Dinshaw & Impey 1996; 
Williger et al. 1996) This indicates that supercluster type 
structures are observed at redshifts ~ 0.5 — 2.5 - the fact 
which may be used as a useful constraint for existing models 
of structure formation and evolution. 

If the quasar large-scale structures are real, we should 
be able to detect them using statistical methods. Deng et al. 
(1994) used the second derivative of the integral two-point 
spatial correlation function to search for typical scales in 
the distribution of quasars. They argued that there exists a 
typical scale of about 95/i _1 Mpc and that this scale may 
be related to the specific shape of the initial perturbation 
spectrum. In this paper we present a study of the quasar 
distribution in three deep surveys. We have used statistical 
methods similar to those of Deng et al. (1994). Our goals 
and strategies, however, were somewhat different. First, we 
have chosen to study only the deepest quasar surveys. The 
number density of quasars in these surveys is much higher 
than in the larger surveys which makes them more sensitive 
to the presence of structures at scales 50 — lOO/i -1 Mpc. 
Second, we have planned to look for evidence of individual 
large-scale structures rather than for typical scales in the 
overall quasar distribution (the numbers of objects in the 
deep samples that we used are probably too small for that). 
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The plan of the paper is as follows: in Section 2 we 
present the quasar surveys and briefly discuss the limita- 
tions they impose, in Section 3 we describe the statistical 
methods; in Section 4 we present the results and discuss the 
implications in Section 5. 



2 QUASAR SAMPLES 

The relatively low number density of quasars in existing 
bright quasar surveys makes some of them insensitive to 
spatial correlations at scales < 100/i -1 Mpc (Romberg et al. 
1994). On the other hand, the deepest quasar surveys usually 
have high enough quasar densities (mean separation between 
objects ~ 20 — 40h~ Mpc) to be sensitive to the presence of 
supercluster size (~ lOOft^ 1 Mpc) structures. These surveys 
are usually composed of one or several "beams" sampling 
small areas on the sky. The beam sizes perpendicular to the 
line of sight (~ 30 - 50/T 1 Mpc) are much smaller than the 
sampled distances along the line of sight (~ 2000/i _1 Mpc). 
The "one-dimensional" nature of these surveys makes analy- 
sis and interpretation of the results relatively simple because 
if significant correlations exist, we can then look directly for 
the structures responsible for them. Such analysis is difficult 
for larger surveys because their sky geometry is usually quite 
complicated. However, the presently available deep surveys 
have small numbers of quasars (usually few tens) in a beam, 
which complicates and limits any statistical analysis. One 
then must use statistics suitable for the analysis of small 
data sets. 

For our study we have chosen three quasar surveys pub- 
lished in the literature: i) a survey by Koo & Kron 1988 
(hereafter KK survey), which was compiled using UBVI 
photometry, variability and proper motions of candidates; 
ii) a grism survey by Zitelli et al. 1992 (hereafter (ZM) 2 B); 
and iii) a deep survey by Boyle, Jones & Shanks 1991 (here- 
after BJS) in which quasar selection was based on multicolor 
photometry of the candidates. The BJS survey is composed 
of three beams covering approximately the same area on 
the sky (~ 0.3 sq.deg.). We have, therefore, analyzed these 
three beams (BJS1, BJS2, BJS3) separately. Also, follow- 
ing (ZM) 2 B, their survey can be split into two samples with 
different levels of completeness: a sample of 24 quasars com- 
plete to J = 20.85 in an area of 0.69 square degrees; andW 
sample of 28 quasars complete for 20.85 < J < 22.0 in a( u ) 
circular area of 0.35 square degrees which is contained in 
the previous one. In our analysis we have used a sample of 
38 quasars complete for J < 22. oQ. The basic characteristics 
of all of our samples - surveyed area in square degrees, lim- 
iting magnitude, number of quasars, and redshift interval - 
are presented in Table 1. 

(iii) 

3 METHOD 

In this study, we have used the integral two-point correlation 
function (e.g. Mo et al. 1992; Mo & Fang 1993) which is 



t In this sample we have included all quasars inside the 20-arcmin 
circle denned in (ZM) 2 B. 



Table 1. Surveys used in the analysis. 



Sample 


Area 


Limiting 


Number of 


Redshift 




(sq.deg.) 


magnitude 


quasars 


interval 


KK 


0.29 


B < 22.5 


28 


0.9 - 3.2 


(ZM) 2 B 


0.35 


J < 22.0 


38 


0.4 - 2.8 


BJS1 


0.29 


bj < 21.8 


21 


0.6 - 2.9 


BJS2 


0.33 


bj < 22.0 


19 


0.6 - 2.9 


BJS3 


0.27 


bj < 21.8 


20 


0.6 - 2.9 



related to the usual differential correlation function £(r) as 
follows: 

3 r 

£(r) = — / x 2 (,{x)dx. 
r Jo 

This choice was made because the integral correlation func- 
tion is more stable than £(r) in the analysis of small data 
sets. This is important because we intended to look for fea- 
tures in the correlation function at large scales, where signal- 
to-noise ratio is usually small. We estimate £(r) using a stan- 
dard estimator (e.g. Mo & Fang 1993): 



where LT(r) is the number of pairs with separations less than 
r in an analyzed sample, while H T nd(r) is the corresponding 
number of pairs averaged over an ensemble of random sam- 
ples. To estimate n rnc j(r), a thousand random catalogs were 
created using the smoothing method of Mo & Fang (1993). In 
this method, random samples are constructed by assigning 
each object a random position in the sky within the bound- 
aries of the sample and by drawing redshifts randomly from 
a smoothed version of the original redshift distribution. The 
smoothed redshift distribution is obtained by averaging the 
number of quasars in the redshift interval Az = 0.6 (corre- 
sponding to ~ 500h~ 1 Mpc at z ~ 1.5) around a given red- 
shift. This width of the interval was chosen so that it is small 
enough to preserve the overall survey selection envelope but 
is considerably larger than the scales we are interested in 
(~ lOO/iT 1 Mpc). 

The whole analysis is similar to that of Mo et al. (1992). 
It is outlined in the following series of steps. 

First, we compute the function S(r) = f (r) + 1; 
Then we find the slopes Xi(r) and Tb(r) (or corresponding 
angles Oi and O2) of S(r) for every r bin using linear regres- 
sion of the relation log H(r) — log r in the intervals [r — Ar, r] 
and [r, r + Ar], where Ar is the smoothing scale. Smoothing 
with a given Ar effectively damps out amplitude fluctuations 
in £(r) on scales r < Ar. The choice of Ar is determined 
by the mean separation r between objects in a sample - at 
scales smaller than r the shot noise is significant and must 
be suppressed. 

Finally, we construct the function AO(r) = O2M — Oi(r), 
i.e. the smoothed second derivative of £(r). Significant 
changes in the slope of log H(r) — log r (changes in the shape 
of £(r)) reflect the inhomogeneities in the distribution of 
objects at the corresponding scales. These changes result 
in sharp peaks in AO(r) with scale independent amplitude 
(Mo et al. 1992; Deng et al. 1994). This is very useful when 
working on the large scales where the amplitude of the cor- 
relation function is small. 
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We have estimated rms errors in £(r) using sample-to- 
sample deviations in the random catalogs as well as us- 
ing analytical formula presented by Mo et al. (1992). We 
have constructed n — 1000 random catalogs using the same 
smoothing procedure described above which we then ana- 
lyzed in the same way as real data. In this way we could 
estimate the fluctuation in the number of pairs (and thus 
the fluctuation of £(r)) at a given scale as: 

V n — 1 

here (II) is the number of pairs averaged over all random 
catalogs, and IT is the corresponding number of pairs in i- 
th random catalog. The deviation of A8(r) was estimated 
in a similar way: 

/ ^(Ae, - (AO)) 2 
V n — 1 

These estimates were then compared with the values given 
by the theoretical formulae (Mo et al. 1992). For one- 
dimensional samples they are: 

Nr±Ar = _ (_Arj , *Ae=(^J w 

where L is the sample extent in the redshift direction, N is 
the number of objects in the sample, and Ar is the smooth- 
ing length adopted in the construction of AO(r). We have 
found that errors predicted by these formulae agree well 
with values derived from the sample-to-sample variations. 
We have, therefore, used these formulae to estimate the sig- 
nificance of peaks in A6(r). 

Finally, we have used friend-of- friend cluster analysis 
(e.g. Einasto et al. 1984) to study the quasar distribution in 
the samples directly. The details of the cluster identification 
procedure and the way of estimating the probability for a 
cluster to be random are described in Romberg et al. (1996). 



correlation signal is comparable to that for samples contain- 
ing ten times more quasars (e.g. Mo & Fang 1994; Boyle & 
Shanks 1994). While the number of objects in this sample is 
quite small (28), the high number density of quasars assures 
relatively good statistics of close pairs (see Romberg et al. 
1994 for discussion of close quasar pair statistics). The same 
is true for the (ZM) 2 B sample. 

The most interesting feature of the correlation function 
is a "bump" at ~ 50ft -1 Mpc which is also seen as a sharp 
negative peak in AO(r) (fig. lb). The significance of the 
peak is ~ 2.5a. Using friend-of- friend cluster analysis we 
have found a quartet and two triplets (i.e. 10 quasars out of 
a total of 28) with sizes along the redshift direction of 43, 30, 
and 34ft -1 Mpc correspondingly. The estimated probability 
for these clusters to be random is quite small for the quartet 
and one of the triplets (0.01 and 0.08) while for the second 
triplet it is rather high (~ 0.5). However, the method we use 
to estimate this probability is not very reliable for triplets 
(and does not work at all for pairs) because the uncertainty 
of density within the triplet is high. We have found that the 
excess of pairs at separations r ~ 30 — 50ft -1 Mpc (caus- 
ing the bump in £(r)) is due to the presence of these small 
clusters. 

4.2 (ZM) 2 B 

For this sample the correlation signal at small separations 
is also quite high. The "bump" is present at ~ 110ft _1 Mpc 
(fig. 2a) with a corresponding peak in AO(r) (fig. 2b) which 
is significant at the ~ 2.5a level. Applying the cluster anal- 
ysis, we have found three quartets and two quintets (22 
quasars out of a total of 38). One of the quartets is probably 
random (estimated probability is ~ 0.6), the probability to 
be random for the rest of the clusters is small (< 0.1) - their 
sizes in z-direction lie in the range 25 - 50ft -1 Mpc. Four of 
these clusters form two "pairs" 100 ± 20k- 1 Mpc creating 
the excess of quasar pairs at these separations. 



4 RESULTS 

The statistics described in the previous section were com- 
puted for all five quasar samples. The functions H(r) = 
f(r) + 1 and AO(r) for each sample are plotted in figs. 1 
and 2. On these plots the functions are shown by solid lines 
and one sigma error envelope by the thin dashed lines. The 
features in the correlation function which we discuss be- 
low are indicated by arrows. The mean separations between 
quasars in the samples are ~ 20 — 30ft -1 Mpc for the RR 
and (ZM) 2 B surveys, and ~ 40ft -1 Mpc for the BJS survey. 
This determined the choice of smoothing length for AO(r) 
- 20ft -1 Mpc and 40ft -1 Mpc correspondingly (on the plots 
AO(r) = at r < Ar). The integral correlation function 
was computed with a bin size of 2.5ft -1 Mpc for the RR 
and (ZM) 2 B samples and with a 5ft -1 Mpc bin for the BJS 
samples. Below we describe results for each sample sepa- 
rately. 

4.1 KK 

Quasars in the RR sample show strong correlations at small 
scales r < 10ft -1 Mpc (fig. la). Statistical significance of the 



4.3 BJS1 

The statistics for the first BJS sample are shown in figs. 2a 
and b. There is a rise in H(r) at a separation of ~ 40ft -1 
Mpc. This rise is not reflected in A6(r) because its scale 
is equal to the smoothing scale. Using cluster analysis we 
have detected two clusters in this sample - a triplet and a 
quintet (8 out of a total of 21 quasars). The z-sizes of both 
clusters are ~ 35ft -1 Mpc. The estimated probability to be 
random is 0.05 and 0.01, respectively. Both of these clusters 
contribute to the excess of quasar pairs at separations of 
30 - 40ft -1 Mpc. 

4.4 BJS2 

Results for the BJS2 sample are shown in figs. 2c and d. The 
decay of E(r) at r < 30ft -1 Mpc is caused by the fact that 
the number density of quasars in this sample is considerably 
lower than in RR and (ZM) 2 B and even lower than in BJS1 
and BJS3. This sample therefore just lacks close pairs. The 
correlation function fluctuates considerably at scales ~ 150— 
200ft" 1 Mpc (there is a significant excess of pairs at these 
separations as compared to the random distribution) . These 
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fluctuations correspond to the positive and negative peaks in 
A0(r) at ~ 150ft" 1 Mpc and ~ 180ft" 1 Mpc. Using cluster 
analysis we have found a triplet (z-size ~ 20ft" 1 Mpc), a 
quartet (z-size ~ 20h~ 1 Mpc), and a sextet (z-size ~ 75ft" 1 
Mpc) . The probability that each individual cluster is random 
is smaller than 0.05. Our analysis has shown that the excess 
of pairs at separations 150 — 200ft" 1 Mpc is explained by 
the distance between the quartet and the sextet (~ 180ft" 1 
Mpc). 

4.5 BJS3 

The correlation function for this sample (fig. 2e) has two 
broad "bumps" at scales ~ 100ft" 1 Mpc and ~ 200ft" 1 Mpc. 
There are two negative peaks in the AO(r) corresponding 
to these fluctuations (fig. 2f). Statistical significance of both 
peaks is ~ 2a. The cluster analysis failed to detect any large 
clusters in this sample. However, the distribution of quasars 
in this sample is quite interesting. There are seven rela- 
tively close pairs (distances between quasars in 5 of them 
are less than 30ft" 1 Mpc and in the other two ~ 40ft" 1 
Mpc) separated from each other by either 80 — 100ft" 1 Mpc 
or 180 — 200ft" 1 Mpc. This causes the fluctuations in 3(r) 
at the corresponding scales. 



5 DISCUSSION AND CONCLUSIONS 

The results presented in the previous section suggest that 
the distribution of quasars in the analyzed samples is not 
homogeneous at scales of a few tens of megaparsecs. Many 
quasars belong to clumps of sizes 30 - 70ft" 1 Mpc. The 
clumps are often separated by 100 — 200ft" 1 Mpc, which cre- 
ates a pair excess at the corresponding scales. Qualitatively 
this quasar distribution is very similar to that of CIV absorp- 
tion systems discussed in the recent paper by Williger et al. 
(1996). They have found that their CIV sample contains two 
groups of 7 and 5 absorbers of sizes ~ 43 x 17 x 69ft" 3 Mpc 3 
and ~ 25 x 4 x 53ft~ 3 Mpc 3 (comoving) located at z ~ 2.3 
and z ~ 2.5, respectively. The distance between these two 
groups (~ 50 — 120ft" 1 Mpc) results in "beating" (the pair 
excess) giving rise to the correlation signal at these separa- 
tions (3.5cr significance level). A number of smaller clumps 
were also detected. The similar clusters of CIV absorbers 
were also found in earlier studies by Jakobsen & Perryman 
(1992), Foltz et al. (1993), and Dinshaw & Impey (1996). Re- 
cently, Lespine & Petitjean (1996) presented evidences for 
a coherent structure extended over ~ 80ft" 1 Mpc at * » 2 
in the distribution of metal absorption systems. Although 
the numbers of CIV systems are also small, the similarity 
of the results may suggest that both quasars and CIV ab- 
sorbers may trace the same kind of underlying structures 
in the matter distribution. Unlike Deng et al. (1994) we did 
not observe any evidence for a periodic signal in the function 

AG(r)[j. This may be caused by the small number statistics. 

The clumpy distribution of quasars in the analyzed 
samples is consistent with the recent studies of the quasar 
distribution in the larger samples (Crampton, Cowley & 

t Although A©(r) seems to be periodic, most of the peaks are 
not significant (their height is < 1<t). 



Hartwick 1987, 1989; Clowes & Campusano 1991a, 1991b; 
Graham, Clowes & Campusano 1995; Romberg et al. 1996). 
These samples were found to contain several relatively rich 
(~ 10 — 25 QSOs) groups of quasars with sizes in the red- 
shift direction of ~ 70 — 160ft" 1 Mpc. The small extent of 
the pencil-beam samples perpendicular to the line of sight 
prevents detection of such large groups. However, the de- 
tected smaller clumps can easily be parts of larger systems. 
It would be very interesting to check this by studying larger 
deep samples which are currently underway (e.g. Hall et al. 
1996). 

If quasars and CIV absorption systems trace the matter 
distribution at high redshifts as galaxies or galaxy clusters 
do at low redshifts, their clumpy distribution suggests that 
large-scale inhomogeneities similar to the nearby superclus- 
ters were already distinct at z ~ 1 — 2. This information may 
provide some useful insights into the physics of high red- 
shift Universe. The fact that we see structures at redshifts 
2 ~ 1 — 2 similar to the superclusters at z ~ (Romberg 
et al. 1996), for instance, favors low-density ACDM or low- 
density CDM models in which perturbation amplitude at 
large scales stops growing at z > 1. On the other hand, 
rather high quasar-quasar correlations at small separations 
and high number density contrasts in the detected quasar 
groups may indicate that the distribution of quasars is 
highly biased with respect to the matter distribution. Al- 
though present available surveys are too small to provide a 
statistically reliable estimate of the power spectrum P(k), 
in the future, with bigger quasar samples and better models 
for both QSOs and CIV absorbers, we will be able to get 
useful constraints on the spectrum, and thus on the theo- 
ries of structure formation, for a wide range of scales and 
redshifts (e.g. Romberg & Lukash 1994). 
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FIGURE CAPTIONS 

Figure 1. The integral two-point spatial correlation 
function £(r) for the KK (panel a) and (ZM) 2 B (panel c) 
samples and its smoothed second derivative (panels b and 
d) AO(r) (see definitions in Section 3). The functions are 
solid lines. The la error envelope is shown by thin dashed 
lines. The features in the correlation function discussed in 
the text (Section 4) are indicated by arrows. 

Figure 2. The integral two-point spatial correlation 
function £(r) for the BJS1, BJS2, and BJS3 samples (panels 
a,c, and e, respectively) and its smoothed second derivative 
AO(r) (panels b,d, and f). The structure of the plots is the 
same as for Fig.l. 
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